Multiple studies have focused on predicting the prospective popularity of an online document as a whole, without paying attention to the contributions of its individual parts. We introduce the task of proactively forecasting popularities of sentences within online news documents solely utilizing their natural language content. We model sentence-specific popularity forecasting as a sequence regression task. For training our models, we curate InfoPop, the first dataset containing popularity labels for over 1.7 million sentences from over 50,000 online news documents. To the best of our knowledge, this is the first dataset automatically created using streams of incoming search engine queries to generate sentence-level popularity annotations. We propose a novel transfer learning approach involving sentence salience prediction as an auxiliary task. Our proposed technique coupled with a BERT-based neural model exceeds nDCG values of 0.8 for proactive sentence-specific popularity forecasting. Notably, our study presents a non-trivial takeaway: though popularity and salience are different concepts, transfer learning from salience prediction enhances popularity forecasting. We release InfoPop and make our code publicly available: https://github.com/sayarghoshroy/InfoPopularity
translated by 谷歌翻译
肾脏是人体的重要器官。它保持体内平衡并通过尿液去除有害物质。肾细胞癌(RCC)是肾癌最常见的形式。大约90%的肾脏癌归因于RCC。最有害的RCC类型是清晰的细胞肾细胞癌(CCRCC),占所有RCC病例的80%。需要早期和准确的CCRCC检测,以防止其他器官进一步扩散该疾病。在本文中,进行了详细的实验,以确定可以在不同阶段诊断CCRCC的重要特征。 CCRCC数据集从癌症基因组图集(TCGA)获得。考虑了从8种流行特征选择方法获得的特征顺序的新型相互信息和集合的特征排名方法。通过使用2个不同的分类器(ANN和SVM)获得的总体分类精度来评估所提出方法的性能。实验结果表明,所提出的特征排名方法能够获得更高的精度(分别使用SVM和NN分别使用SVM和NN),与现有工作相比,使用SVM和NN分别使用SVM和NN进行分类。还要注意的是,在现有TNM系统(由AJCC和UICC提出的)提到的3个区分特征中,我们提出的方法能够选择其中两个(肿瘤的大小,转移状态)作为顶部 - 大多数。这确立了我们提出的方法的功效。
translated by 谷歌翻译
图像着色是计算机视觉中的一个众所周知的问题。但是,由于任务的性质不足,图像着色本质上是具有挑战性的。尽管研究人员已经进行了几次尝试制作着色管道自动化,但由于缺乏调理,这些过程通常会产生不切实际的结果。在这项工作中,我们试图将文本描述与要着色的灰度图像一起集成为辅助条件,以提高着色过程的忠诚度。据我们所知,这是将文本条件纳入着色管道中的首次尝试之一。为此,我们提出了一个新颖的深网,该网络采用了两个输入(灰度图像和相应的编码文本描述),并试图预测相关的颜色范围。由于各自的文本描述包含场景中存在的对象的颜色信息,因此文本编码有助于提高预测颜色的整体质量。我们已经使用不同的指标评估了我们提出的模型,并发现它在定性和定量上都优于最先进的着色算法。
translated by 谷歌翻译
在计算机视觉中,人类的姿势合成和转移与以前看不见的姿势的概率图像产生相关的概率图像产生。尽管研究人员最近提出了几种实现此任务的方法,但这些技术中的大多数直接从特定数据集中的所需目标图像中得出了姿势,这使得基础过程挑战在现实世界情景中应用于目标图像的生成是实际目标。在本文中,我们首先介绍当前姿势转移算法的缺点,然后提出一种新型的基于文本的姿势转移技术来解决这些问题。我们将问题分为三个独立的阶段:(a)文本构成表示,(b)姿势改进,(c)姿势渲染。据我们所知,这是开发基于文本的姿势转移框架的首次尝试之一,我们还通过为DeepFashion数据集的图像添加描述性姿势注释,从而引入了新的数据集DF-PASS。所提出的方法在我们的实验中产生了具有显着定性和定量得分的有希望的结果。
translated by 谷歌翻译
随着大型预训练的语言模型(例如GPT-2和BERT)的广泛可用性,最近的趋势是微调一个预训练的模型,以在下游任务上实现最新的性能。一个自然的示例是“智能回复”应用程序,其中调整了预训练的模型以为给定的查询消息提供建议的答复。由于这些模型通常是使用敏感数据(例如电子邮件或聊天成绩单)调整的,因此了解和减轻模型泄漏其调整数据的风险很重要。我们研究了典型的智能回复管道中的潜在信息泄漏漏洞,并引入了一种新型的主动提取攻击,该攻击利用包含敏感数据的文本中的规范模式。我们通过实验表明,对手可以提取培训数据中存在的敏感用户信息。我们探讨了潜在的缓解策略,并从经验上证明了差异隐私如何成为这种模式提取攻击的有效防御机制。
translated by 谷歌翻译
pla窃意味着从事他人的工作,而不为此归功于他们。窃是学术界和研究人员中最严重的问题之一。即使有多种工具可以在文档中检测窃,但其中大多数是特定于域的,旨在在英语文本中起作用,但pla窃不仅限于单一语言。孟加拉语是孟加拉国最广泛的语言,是印度第二大口语的语言,有3亿本人的母语和3700万本语言的人。窃检测需要大量的语料库进行比较。孟加拉文学的历史为1300年。因此,大多数孟加拉文学书籍尚未正确数字化。由于我们的目的没有这样的语料库,因此我们从印度国家数字图书馆收集了孟加拉文学书籍,并从中提取了全面的方法论并构建了我们的语料库。我们的实验结果发现,使用OCR,文本提取的平均准确性在72.10%-79.89%之间。 Levenshtein距离算法用于确定窃。我们已经构建了一个用于最终用户的Web应用程序,并成功地测试了孟加拉文本中的窃检测。将来,我们旨在构建一个具有更多书籍的语料库,以进行更准确的检测。
translated by 谷歌翻译
我们简要介绍了从实验神经科学的研究结果对生物学学习的共同假设,并以经常性神经网络的梯度学习效率对比。本评论中讨论的关键问题包括:突触可塑性,神经电路,理论实验划分和客观功能。我们在设计新的研究时,我们的建议与理论和实验神经科学家的建议有助于为这些问题带来清晰度。
translated by 谷歌翻译
数据挖掘中的许多基本问题可以减少到一个或多个NP-Colly组合优化问题。诸如量子和量子启发硬件的新技术的最新进展承诺,与使用通用计算机相比,诸如使用通用计算机而且需要以特殊形式进行建模的问题,例如以特殊形式建模的问题,例如诸如ising或二次无约会二进制优化的问题,以解决这些问题的大量加速(qubo)模型,以利用这些设备。在这项工作中,我们专注于重要的二进制矩阵分解(BMF)问题,这些问题在数据挖掘中具有许多应用。我们为BMF提出了两种QubBo配方。我们展示了如何容易地将聚类约束纳入这些配方。我们考虑的特殊用途硬件有限于它可以处理的变量数量,这在分解大矩阵时呈现出挑战。我们提出了一种基于采样的方法来克服这一挑战,允许我们分解大型矩形矩阵。除了这些方法之外,我们还提出了一种简单的基线算法,这些算法优于我们在几种情况下更复杂的方法。我们在富士通数字退火器中运行实验,在合成和实数据上,包括基因表达数据的量子启发的互补金属氧化物半导体(CMOS)退火器。这些实验表明,我们的方法能够生产比竞争方法更准确的BMF。
translated by 谷歌翻译
Unsupervised learning-based anomaly detection in latent space has gained importance since discriminating anomalies from normal data becomes difficult in high-dimensional space. Both density estimation and distance-based methods to detect anomalies in latent space have been explored in the past. These methods prove that retaining valuable properties of input data in latent space helps in the better reconstruction of test data. Moreover, real-world sensor data is skewed and non-Gaussian in nature, making mean-based estimators unreliable for skewed data. Again, anomaly detection methods based on reconstruction error rely on Euclidean distance, which does not consider useful correlation information in the feature space and also fails to accurately reconstruct the data when it deviates from the training distribution. In this work, we address the limitations of reconstruction error-based autoencoders and propose a kernelized autoencoder that leverages a robust form of Mahalanobis distance (MD) to measure latent dimension correlation to effectively detect both near and far anomalies. This hybrid loss is aided by the principle of maximizing the mutual information gain between the latent dimension and the high-dimensional prior data space by maximizing the entropy of the latent space while preserving useful correlation information of the original data in the low-dimensional latent space. The multi-objective function has two goals -- it measures correlation information in the latent feature space in the form of robust MD distance and simultaneously tries to preserve useful correlation information from the original data space in the latent space by maximizing mutual information between the prior and latent space.
translated by 谷歌翻译
Cashews are grown by over 3 million smallholders in more than 40 countries worldwide as a principal source of income. As the third largest cashew producer in Africa, Benin has nearly 200,000 smallholder cashew growers contributing 15% of the country's national export earnings. However, a lack of information on where and how cashew trees grow across the country hinders decision-making that could support increased cashew production and poverty alleviation. By leveraging 2.4-m Planet Basemaps and 0.5-m aerial imagery, newly developed deep learning algorithms, and large-scale ground truth datasets, we successfully produced the first national map of cashew in Benin and characterized the expansion of cashew plantations between 2015 and 2021. In particular, we developed a SpatioTemporal Classification with Attention (STCA) model to map the distribution of cashew plantations, which can fully capture texture information from discriminative time steps during a growing season. We further developed a Clustering Augmented Self-supervised Temporal Classification (CASTC) model to distinguish high-density versus low-density cashew plantations by automatic feature extraction and optimized clustering. Results show that the STCA model has an overall accuracy of 80% and the CASTC model achieved an overall accuracy of 77.9%. We found that the cashew area in Benin has doubled from 2015 to 2021 with 60% of new plantation development coming from cropland or fallow land, while encroachment of cashew plantations into protected areas has increased by 70%. Only half of cashew plantations were high-density in 2021, suggesting high potential for intensification. Our study illustrates the power of combining high-resolution remote sensing imagery and state-of-the-art deep learning algorithms to better understand tree crops in the heterogeneous smallholder landscape.
translated by 谷歌翻译